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We present a detailed discussion of the role played by memory, and the nature of self-induced 
shocks, in an evolutionary population competing for limited resources. Our study builds on a 
previously introduced multi-agent system [Phys. Rev. Lett 82, 3360 (1999)] which has attracted 
significant attention in the literature. This system exhibits self-segregation of the population based 
on the 'gene' value p (where < p < 1), transitions to 'frozen' populations as a function of the 
global resource level, and self-induced large changes which spontaneously arise as the dynamical 
system evolves. We find that the large, macroscopic self-induced shocks which arise, are controlled 
by microscopic changes within extreme subgroups of the population (i.e. subgroups with 'gene' 
values p ~ and p ~ 1). 



I. INTRODUCTION 



The dynamical behavior of a population of objects or 
'agents' (e.g. software or hardware modules, cellular or- 
ganisms such as bacteria or viruses, human beings, ani- 
mals) is of interest across a range of disciplines. Physics is 
arguably luckier than most disciplines in that the 'agents' 
of interest (i.e. particles) do not adapt their behavior ac- 
cording to past failure, hence evolving new sets of rules 
as time progresses. Nor do the agents in question have 
any individual memory. Biological and social disciplines 
are not so lucky. Through a desire to develop a min- 
imal model which could incorporate such features into 
a manageable yet non-trivial system, Arthur introduced 
the so-called 'El Farol' Bar Problem, which concerns the 
repeated competition between bar-goers to attend a pop- 
ular bar with limited seating [|. Challet and Zhang 
subsequently introduced a binary version of this bar 
problem for the case where the amount of resource (e.g. 
number of seats) is just less than half the number of 
agents (e.g. possible attendees). This system is referred 
to as the Minority Game. 

The Minority Game docs not allow an agent to contin- 
uously evolve new strategies and hence explore the entire 
strategy space. The Minority Game is also essentially de- 
terministic, apart from occasional coin-tosses which are 
used to break ties in strategy scores. Furthermore the 
resource level is set at just less than half the number of 
agents, so that there are always more losers than winners. 
To help overcome these limitations, Johnson et al. intro- 
duced a stochastic version of the Minority GameQ which 
is subsequently referred to as the Genetic Model, in which 
an agent's strategy (characterized by a 'gene' value p) can 
evolve indefinitely in time, and is in principle allowed to 
access the entire space of strategies (i.e. all p values). 
The resulting Genetic Model has provoked much interest 
in the literature (for example see Refs. 0, 0, IE S H ) • 
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As commented in the original paper of Johnson et al.Q 
and confirmed by Burgos et al. 0, , the self-segregation 
observed in the Genetic Model is insensitive to changes 
in an agent's memory length m. 

In this paper we present a detailed discussion of the 
role of memory in the Genetic Model. We also explain 
the origin of the remarkable step-like structure in the 
global output time series as a function of the resource 
level, which was first observed in by Johnson et al. in 
Ref. Q. We then introduce (Sec. IIII|) a new variant of 
the Genetic Model in which the number of agents com- 
peting at a given time step, is allowed to fluctuate. Be- 
cause of the analogy with the Grand Canonical Ensem- 
ble in physics, we shall refer to this model as the 'Grand 
Canonical Genetic Model' (GCGM) By considering ver- 
sions of the GCGM both with and without memory, we 
shall investigate the endogenous (i.e. self-induced) large 
changes which arise in the system. These large changes 
represent abrupt macroscopic 'shocks', and occur with 
a greater probability than would be expected based on 
random walk statistics. We provide a detailed analysis 
of the mechanism that generates these large changes. 



II. EFFECTS OF MEMORY IN THE GENETIC 
MODEL 

Various papersd S S have made claims with re- 
gard to the role of memory in the Genetic Model. To 
date though, no one has performed a detailed analy- 
sis of this problem. In this section we present such an 
analysis which involves comparing the behavior of the 
original model with that of a memorylcss variant. The 
results presented here for the Genetic Model are remi- 
niscent of earlier results for the Minority Game. In par- 
ticular, Hart et al.^lj showed that a crowd-anticrowd 
theory which assumes random history, provides a quan- 
titative description of the time-averaged fluctuations in 
the Minority Game. Subsequently Cavagna ^lj demon- 
strated numerically that the time-averaged fluctuations 
were indeed largely unaffected if the global history was 
replaced with randomly generated data. 
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Number of agents: N = 501 Memory length: m = 4 
Death score: D = 4 Mutation range: r = 0.2 



TABLE I: Parameters used to generate numerical data. 

Ceva and Burgos |f| investigated explicitly the role of 
memory in the Genetic Model, however the results are 
restricted to a comparison of the gene value distributions 
in the minority case, in which the amount of resource is 
just less than half the number of agents. In contrast, in 
this section we shall treat the general case, in which the 
amount of resource is unrestricted, and provide some the- 
oretical analysis to explain the differences that we shall 
observe between the two models. We shall show in Sees. 
Ill Gl and 111 HI that the observation that the gene value 
distributions are identical only holds in the special case 
considered in Ref. ||. Furthermore, we shall show that 
the feedback introduced by the existence of memory does 
influence the behavior of the model in that it controls the 
time-average of the prediction (Sec. Ill G|) and introduces 
autocorrelations into the mean attendance time series 
(Sec. Ill If) . By comparing and contrasting the memo- 
ryless Genetic Model with the original model we shall 
be able to make some important observations about the 
memory's true significance. We shall also answer some 
important questions as to the extent to which memory is 
of benefit to the agents and the system as a whole. 

Before going on to describe the Genetic Model, we note 
that all of the numerical results presented in this paper 
were obtained using the model parameters listed in Table 
HI unless otherwise stated. Similarly all time averages are 
taken over the period 10000 < t < 60000. The first 10 
000 time steps being neglected to allow any transients 
due to the initial conditions to die away. 



A. Original Genetic Model 

In this section we present expressions for some of 
the most basic quantities in the original Genetic Model, 
which includes the memory. In Sec. Ill Bl wc shall consider 
the equivalent expressions in the memory less variant. We 
start with a brief summary of the Genetic Model. Fuller 
details are given in Ref. jj]. 

The Genetic Model consists of a population of agents 
who must decide at every time step between two possible 
choices. We shall refer to the decision of an agent i as 
its action, a^t- Each agent is defined by a gene value, 
Pi-t, which can take any value < pi-t < 1. At each time 
step the model makes a prediction, h t , of the outcome of 
the time step available to all of the agents. Each agent 
chooses its action to be equal or opposite to this predic- 
tion with probabilities pi- t and 1 — p^t respectively. h t 
is calculated based on a global memory that the model 
maintains of the outcome of the previous m time steps 
and the assumption that patterns that have occurred in 



the time series of these outcomes in the past will recur 
in the future. The parameter m is known as the mem- 
ory length. The outcome of each time step is determined 
based on the actions of all of the agents. From now on 
we shall refer to the outcome as the global action at time 
t, A t . 

Agent gene values are not constant with time. Each 
agent maintains a record of its score, Si :t , which deter- 
mines when it changes its gene value. At every time 
step, Si-t increases by one unit if ai-t = +A t and de- 
creases otherwise. If s^t = —D then the agent mutates. 
The parameter D is known as the death score. When 
an agent mutates it chooses a new gene value at random 
from a range of values of width 2r centered on the old 
gene value. The parameter r is known as the mutation 
range. 

Let the possible values taken by h t be —1 and +1. 
Agent i makes a choice to follow the prediction (a^t = 
+ht) with probability pi- t or to refute it (<ij ;t = — h t ) with 
probability 1 — pi-t- There are two opposite definitions of 
the global action A t used in the literature. For example, 
Ref. Q defines A t , by analogy with Zhang and Arthur's 
Bar model, to be the state of the bar at time t. Thus, 
A t = +1 would denote an overcrowded bar and the opti- 
mal action of each agent would be to stay at home (i.e. 
Oi-t = —1). However, in this paper we shall adopt the 
convention of Ref. |j| whereby A t represents the optimal 
decision of each agent at time t. The global action A t is 
given, in terms of a model parameter I, which can take 
values < I < 1, by: 



-4, 



+ 1 nf 1 < Nl 
-1 n t +1 > Nl 



where: 




(1) 



(2) 



In other words, nf is the number of agents for which 
ai-t = +1. We shall refer to I as the resource level. 

Let {pi-t}i denote the set of values of p^t for all agents 
at time t. It is easy to show that the ensemble average 
number of agents following the prediction (nf ht ) is given 
by: 



NPt 



where: 



Pi 



1 N 



(3) 



In equilibrium, where the population evolves such that p t 
is approximately constant, the ensemble average (nf hl ) 
and the time average (n 



+h t 



(nt ht )t 



>t will coincide. Thus: 
* N(p t )t ■ (4) 



B. Memoryless Genetic Model 

In this section we introduce a memoryless variant of 
the original Genetic Model. In contrast to the original 
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model described above, where the gene value of the zth 
agent Pi- t gives the probability of it choosing to follow 
the prediction (a^t = +h t ), in the memoryless model 
Pi-t gives the probability that cii-t = +1 directly. With 
this modification the prediction ht and hence the global 
memory that produce it become redundant and can be 
removed from consideration. The agents in this variant 
are memoryless, by which we mean that their actions a^t 
at time t are independent of the state of the model at 
earlier times. a^ t is dependent only on p i]t . An equivalent 
way of considering this is to take h t = +1 V t. The global 
action A t and the number of agents attending the bar nf l 
are unchanged and so are given by Eqs. Q and as 
before. However, Eq. Q becomes: 



For h f 



N(p 



(5) 



C. Comparison of the performance of the original 
and memoryless models 

We shall now compare the performance of the two mod- 
els. In order to quantify performance we define bit to be 
the total number of points scored by all of the agents 
at time t. Therefore we are considering the performance 
of the system as a whole, rather than that of individual 
agents. If we consider the models to be analogous to an 
economic system then the question that we are investi- 
gating becomes, to what extent can the agents in this 
system exploit the potential wealth available to them as 
a population. Note that in Ref. [(| Burgos et al. treat 
the memoryless Genetic Model in terms of a cost function 
given by the second moment of nf 1 with respect to Nl. 
However this cost function is symmetric in that it assigns 
an equal cost to deviations of nf 1 from Nl of opposite 
signs. As we shall see, Ut is not symmetric about Nl 
and hence can distinguish between positive and negative 
deviations. 



1. Original Genetic Model 

First we shall derive expressions for Lit in the model 
with memory. Later we will see how these expressions 
are modified in the absence of the memory. From Eq. 
(1} the condition that A t = +1 is: 



n+ h> < Nl 



> N(l - I) 



if h t 
if h t 



-1 
1 . 



(6) 



Now consider the total number of points scored by the 
agents, Ut- Agents for which a.; ;f = +A t will gain one 
point whereas agents for which a^-t = —A t will lose. If 
ht = +1 and A t = +1 it will be the 
choose to follow ht who will gain. If A t 



N- 



agents who 
T, then the 

4 agents who choose to refute h t will gain. Thus, 



using Eq. ©, we have: 



U t (nt ht ) 



2n\ 



N if n+ h> < Nl 



-2nj' M +N if 



> Nl 



(7) 



h 



When h t = -1 and A t = +1 it will be the N - nl 
agents who choose to refute the prediction who will gain 
and vice- versa for A t = — 1. Thus, for h t — — 1 the above 
expression becomes: 

For h t = -1: 



Ut{nt ht ) 



-2n~i 



+h t 



N 



iint ht > N(l - I) 



2n 



+h t 



N if 



<N(l-l) 



(8) 



The expressions in Eqs. [7|and|Hlare plotted in Fig. ^ 
The black and white circles represent the value of nf ht 
at which Ut is a maximum for h t = +1 and ht = — 1 
respectively. We shall call the value of nf ht at which Ut is 
a maximum the optimal value and denote it by opt[n 
From Fig. Q]we can see that opt^^*] is given by 
For I < 0.5: 



opt[n+ ht 



For I > 0.5: 




opt [5 



,+h t ] _ 



Nl 

NO- -I) 



if h t 
if h t 



if ht = 
if h t = 



(9) 



(10) 



The most important feature of this equation to recognize 
is that, in general, there is no unique value of opt[i 



independent of t. Only if h t 
a unique solution exist. 

We note in passing that since 



1 or — 1 V t would such 



N 



,+1 



optimal value of n~l in the original model is: 
opt[n^] = Nl . 



the 



(11) 



In contrast to opt[n^ *], the optimal value of is inde- 
pendent of t. Nevertheless the most important quantity 
for the analysis that we present here is opt[n t f,lt ] in the 
case of the original model since, as we shall see later, it 
is the value of nf ht that the agents can directly control 
and not that of nf 1 . 

Ref. demonstrated the existence of so-called frozen 
regimes which exist when / lies outside of the region 
bounded by two critical values, which we shall label here 
l c l and l C 2- These regimes were described as quenched 
by Burgos ct al. in Ref. |2J. The frozen regime obtains 
when I < l c i or / > The behavior of the Genetic 
Model in these regimes is well understood^,!?]], therefore 
we shall restrict ourselves to a consideration of the dy- 
namic regime. In what follows, unless otherwise stated, 
I takes values on the interval l c i < I < Z c2 . Therefore, 
inequalities such as / < 0.5 should be taken as shorthand 
for l c i < I < 0.5. 
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FIG. 1: Global utility as a function of nf ht in the original Genetic Model with memory. The dashed lines indicate Ut when 
ht — — 1, the narrow lines Ut when ht = +1. The emboldened lines represent those part of the line defined by Ut which are 
invariant under ht — ► —ht. The black and white circles represent the values of nf ht at which Ut is a maximum when ht = +1 
and —1 respectively. 



Compared with the variation of h t , agent mutation is 
a slow process. The value of ht changes on a timescale 
of At ~ 1 while agents mutate on a time scale At ^> D. 
The result of this is that we should not expect the agents 
to be sensitive to the instantaneous value of lit given in 
Eqs. and (JSJ). They will only be sensitive to the time 
average (U t )t- Translated to the conventions used here, 
Ref. H found that: 



(ht 



+0.5 I > 0.5 
-0.5 I < 0.5 



(12) 



Therefore, for I < 0.5, h t = +1 for a fraction 0.25 of the 
time steps whereas the fraction is 0.75 for I > 0.5. Thus 
we can calculate the following expression for (Ut(n^ ht )) t : 
For I < 0.5: 



For I > 0.5: 



2nt hi - N 

,+ht N 



n? ht < Nl 



Nl < n't * < N(l — I) (13) 
-2n+ ht +N nf ht > N(l-l) 



2n 



+h t 



(U t 



- N 

N 



2nt ht + N n+ nt > Nl 



n+ ht < N(l - I) 

N(l -l)< n+ ht < Nl . (14) 

+h t 



This expression is plotted in Fig. From the figure we 
can see that the optimal value of ri^ ht that maximizes 
(U t (nt ht ))t is given by: 



+h J JV(1 -0-1 ^ < 0.5 

optK '] = 



Nl 



I > 0.5 



(15) 



Note that although we have assumed the values given 
in Eq. (|12f) for (h t )t, the values of opt[nf ht ] given in 



the above equation in fact only depend upon the signs of 
the values of (h t )t- The significance of this will become 
apparent in Sec. Ill Gl 

Let IIj ;t be the probability that nf ht = i. Lo ct al. 01 
demonstrated that IT ;t will be approximately Gaussian 
with mean /i = (n^ ht ) t = Np t and standard deviation 
ct = \/J2iPi;t(^ — Pi;t)- From Eq. (fT5f) it follows that the 
optimal form of IT ;t will obtain if p t is as given by the 
following equation and a = 0: 



opt ^ 



i Z<0.5 
I > 0.5 



(16) 



The term -A> results from the asymmetry of the condition 
in Eq. which determines A t in terms of 



j_ 

N 



For 
I 

and so we can neglect this term . a = if {pi t t}i contains 
only the values and 1. Let P(x) be the distribution of 
{pi,t}i such that NP(x)dx is the probability that if an 
agent i is chosen at random from the set {pi-t}i then 
x < Pi-t < x + dx. The optimal form for P(x) is then: 

P(x) = (1 - o V t[p t ])8(x) + opt||i t ]*(l - x) . (17) 

This represents a distribution which is zero everywhere 
except for peaks at x = and x = 1. The relative heights 
of the peaks being such that \i = opt[p t ]. 

It is well known 0, 0, ^3 that in the long time limit 
where t — > oo the population of agents evolves such that 
P(x) is strongly peaked about x = and x = 1 and p t 
takes the value given by Eq. I|16|) . Although the agents 
never manage to achieve a form such that the standard 
deviation cr(nf ' , t) is exactly zero, they do approach 
the optimal distribution represented by Eq. JT7J. Thus, 
we can see that the population of agents is capable of 
evolving such that IL. t is close to its optimal form and 
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the time series of values of nf 1 contains values clustered 
around the optimal value opt[?i^' lt ] given by Eq. I|15|) . 



2. Memoryless Genetic Model 

In this section we shall see how the above analysis ap- 
plies to the memoryless variant of the model, bit in the 
memoryless model is given by Eq. J7J above. Thus: 



U t (n 



+1 ) = 



2nt 



N 



if nf 1 < Nl 



-2nt +N if ■ 



> Nl 



(18) 



Equation i|18[) is plotted in Fig. [2p. Once again the 
black circle represents the value of nf 1 at which U t is a 
maximum. From the figure we can see that the optimal 
value of nf 1 is now: 



opt[n 




if I < 0.5 
if I > 0.5 



(19) 



,+i 



is independent of t. As before, the optimal form 
of Hi-t will be that for which a = cr(nf ,t) = and 
fj, = (n/~ )t = Np t with p t given by: 



opt [p t 



i + j? a i < o.5 

l if l > 0.5 



(20) 



Note that, from Eqs. (|16f) an d (|20fl . this will mean that 
the gene value distribution of the agents for I < 0.5 in the 
original and memoryless models will not be identical, but 
will be related by the transformation {pi-t}t — * {1— Pi ; t}t- 



3. Direct comparison of the models 



In 



this section we shall compare the value of 
U t (opt[n? ht }) in the original Genetic Model with that of 
btt^optlnf 1 ]) in the memoryless Genetic Model in order 
to establish what effect the memory has on the perfor- 
mance of the model. In both the original Genetic Model 
with memory and the memoryless variant, the agents are 
rewarded based on the value of nf . This is because the 
value of the global action is determined from the con- 
dition on n +1 in Eq. and an agent i gains or loses 
one point depending on whether a^ t = ±A t . There is, 
however, one difference between the two models that will 
be extremely important in what follows. The population 
of agents can control IL, ;t through their effect on p t and 
(7 = \f^2iPi;t{^ — Pi;t)- In the memoryless model IT ;t 
represents the probability distribution for nf l whereas in 
the original model IT ;t represents the distribution func- 
tion for nf 4 . The result of this is that in the memoryless 
model the population of agents can directly control the 
values that occur in the time series of rif 1 whereas in the 
original model they can only control the values of nf ' . 



In the latter case nf 1 will also depend on the value of h t 
over which the agents have no direct control. 

From Eqs. flUf . (fTljl and (fT£|) it follows that the maxi- 
mum values of Ut (which obtain at opt[n^~' lt ] and opting 1 ] 
given by Eqs. I|15|) an d (|19|l 'l are: 
memory: 



U t {opt[n+ ht }) 



no memory: 



^(optK- 1 ]) 




21) - 1 



N -{21 - 1) 



(N(l - 21) 
[N(2l - 1) 



if I < 0.5 

if; > o.5 



if I < 0.5 
if I > 0.5 



(21) 



(22) 



Thus the optimal value of Ut in the original model is ex- 
actly half that achieved by the memoryless model. As 
we suggested above, the reason for this is because in the 
original model nf 1 is a function of both ht and IT ;t . The 
instantaneous optimal value of nf ht will therefore de- 
pend on h t (see Fig. ^and Eq. ©). Note that the value 



Unlike the original Genetic Model, this optimal value of f 



that maximizes the time average (JJt{ n t ')}ti 



opt[n t l_/lt ] given by Eq. I|15fl wm always be one of the 
instantaneous optimal values given in Eq. 0. Thus the 
agents cannot improve the global utility by varying p t . 
They adopt the value of p t that is optimal for the most 
common value of ht , but they must pay the penalty when 
ht takes the opposite value. In contrast, in the memo- 
ryless model the instantaneous optimal value of nf 1 in 
Eq. (|19|) is independent of t. Thus, by evolving such that 
p t = opt[p t ] the agents can insure that nf 1 is close to the 
optimal value at each time step. 



D. Analytical expressions for {n^ 1 )t and ^{nf 1 ^) 



We can use the same method that we used to derive 
the expression for (U t (nf ht )) t in Eqs. l(T3|l and (fTljl to 
obtain expressions for (n^ l " 1 )t and the standard deviation 
^(n^ 1 ,^) of the n^time series. From Eq. Q: 

(n+ ht )=Np t . (23) 

This leads to the following expression for (nf 1 ): 



if ht = -1 
if h t = +1 



(24) 



Taking the time average in exactly the same way as in 
Sec. Ill G II in equilibrium where p t is approximately con- 
stant, yields expressions for (nf l ) t and ((nf 1 ) 2 )^- 



£(3-2<p t ) t ) 
£(l + 2(p t )t) 




-6<Pt>t+3) 
-2&>*+l) 



if Z < 0.5 
if Z > 0.5 

-a 2 if Z < 0.5 
-a 2 if Z > 0.5 



(25) 



(26) 
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FIG. 2: a, The average global utility (Ut(nf lt ))t in the original Genetic Model with memory, b, The global utility Utinf 1 ) in 
the memoryless model. 



where a is the standard deviation of II i;t introduced in 
Sec. Ill Al which will be of order unity. If we assume that 
the agents adopt the optimal distribution in Eq. i|17|) 
then we can take a = 0. We now obtain an expression 
for afaf 1 ^) as follows: 



[a(nt\t)]' 



Taking a = gives: 



a(n+ l ,t) 



([n 
3iV 2 



[<■■ 



tit 



t)t 



(27) 



(28) 



Ref. [J| used a mean-field approximation to derive ex- 
for n 

In particular: 



pressions for nf 1 and afaf 1 ,t) in terms of {ht)t and (p t )t- 



K +1 ) t 



a(nt\t) 



f(3-2(p t ) t ) 
f(l + 2(p t ) t ) 



if I < 0.5 
if I > 0.5 



(Pt)t~ 



(29) 
(30) 



Note that we have substituted for (h t )t in the expressions 
of Ref. £|, with the values of (h t )t that obtain for l c \ < 
I < l C 2 (iht)t = 0.75 for I < 0.5 and (h t )t = 0.25 for 
I > 0.5 0|). Thus, we can see that the expressions that 
we derived in Egs -ESt anu <|2*^|> are consistent with those 
obtained in Ref. 



E. Numerical Results 

In this section we present numerical data which sup- 
ports the analytical results that we presented in the pre- 
vious sections. Ref. investigated the behavior of 



(nj" 1 )t and a(nf 1 ,t) as a function of the resource level 
I. Further work was done with regard to the memory- 
less Genetic Model and general I by Burgos et al. in 
Refs. H 0- Figure^ recalls the results of Ref. 0]. 
We can clearly see the dynamic and frozen regimes for 
l c i < I < l C 2 and I < i c i, I > l C2 respectively. Figure 03 
shows (rif )t and a(nf ,t) in the memoryless variant. 
Figure^, shows (p t )t function of / in the original 

and memoryless models while Fig. 0Jd shows (h t )t in the 
original model. 

First of all note that in Fig. QJi (p t )t lies to a very good 
approximation on the following lines: 



memory: (p. 



l-l Z<0.5 



t/t : 



I 



Z >0.5 



no memory: (p t ) t —l 
(31) 



This confirms that the population of agents is capable of 
evolving to achieve the optimal values of p t given in Eqs. 
fTS) and PI). 
In Sec. Ill Dl we presented expressions in Eqs. <|25[1 . 

and a(nf 1 , t) If we substi- 



, J2U and for 



tute for (p t )t in these equations with the optimal values 
from Eq. (|16[) . neglecting the term A, we obtain the fol- 
lowing analytical equations for 
l c i < I < 1 C 2- 



and cr(nj , t) for 



-.,, = ^(1 + 20 



(32) 



In Fig. [3^ we show these analytic expressions together 
with the numerical data and opting 1 ] = Nl from Eq. 

We can see that (nf^t deviates from the optimal 
value of Nl for l c \ < I < 0.5 and 0.5 < I < l C 2 as pointed 
out in Ref. We now know, from Sec. Ill C 31 that 

the reason for this is that the population of agents can 
only control directly and not nf 1 . Thus their per- 
formance is reduced by the action of h t . We can see that 
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a) b) 




FIG. 3: a, Numerical results for (nf 1 )t and <j{n£ , t) as a function of the resource level I in the original Genetic Model. We also 
include lines which represent: optfn^ 1 " 1 ] = Nl from Eq. 111(1 and the analytical expressions for {nf 1 )t and cr(n^ 1 ,f) presented 
in Eq. (1321 . b, (nf 1 ) t and o^n+^i) as a function of the resource level I in the memoryless Genetic Model. 




FIG. 4: a, Numerical results for p t as a function of the resource level I in the original and memoryless models, b, Numerical 
results for (ht)t as a function of I in the original model. 



for l c i < I < l C 2 (n't }t instead lies on the line defined by 
Eq. H32|) H3- In Fig- we a ls° see that the numerical 
data for aiji^ 1 ,t) agrees to a good approximation with 
the expressions in Eq. 1|32|) . The analytic expression de- 
viates from the numerical data in the vicinity of I = 0.5. 
The reason for this is that when we derived the expres- 
sion for a(n+ 1 ,t) in Eq. 

we assumed that the agent 
gene value distribution is as given by Eq. I|17|) and so 
cr = 0. In fact it ^ and near I = 0.5 the a 2 term in Eq. 



(I27|l dominates. Therefore a(nf ,i) docs not go to zero 
as predicted. 

In Fig. 13b we see that, as predicted by Sec. Ill C 31 
(?i ( ^ 1 ) t in the memoryless model does lie on the optimal 
line defined by optfn^ 1 " 1 ] = Nl for l c \ < I < l c2 . Wc 
can also see that cr(n^ l " 1 ,t) in the memoryless model is 
much lower than in the original model. The large value 
of <j{nf ,t) in the original model results from the fact 
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FIG. 5: Numerical results for (n^ 1 ) t and a{nf 1 ,t) in the 
memoryless model using exogenous sources IZo.z, and IZi.o for 
/it. The results for (nf 1 ) t and ainf 1 ,t) in the original model, 
where h t is determined by the memory ({/it}t = 5), are in- 
cluded for comparison. Each pair of lines shows (n^ 1 )t and 
a(nf ,t) for {ht}t given as indicated. 



words taking {h t }t = Tti.o is exactly equivalent to the 
memoryless model that we considered in previous sec- 
tions. In Fig. the data produced using {h t }t = 7^o.5 
for (nf l ) t and cr(n^" 1 ,t) agrees with that from the orig- 
inal model for l c i < I < l C 2- For / < l c \ and I > l C 2 
the data from the original model switches to agree with 
that from the memoryless model with {h t }t = TZi.o cor- 
responding to the value of (h t )t from the original model 
in these regions. Note there is no need to consider IZ-0.5 
and TZ-i.Q. The lack of physical significance attached 
to the labeling of the states of ht means that the model 
behaves equivalently for {h t }t = 7l± a - 

These results confirm that the original Genetic Model 
and the memoryless Genetic Model with h t taken from 
an exogenous source can be regarded as equivalent when 
considering {nf l ) t - In contrast we shall see in Sec. Ill II 
that this does not apply when considering higher mo- 
ments. 



G. The values of (h t )t 



that n^ 1 is a function of both ri£ *, determined via IIi ;t 
by the distribution of agent gene values P{x), and h t . 
In the memoryless model nf l is a function of P(x) only 
which in equilibrium will be approximately constant in 
form. The small remaining fluctuations are due to the 
fact that the agent population does not achieve the ideal 
distribution of Eq. 1)17(1 . We can therefore say that the 
memoryless model is efficient in accessing the available 
resources. 



F. Generation of the prediction from an exogenous 
source 

In Sec. Ill Gl wc showed that the effect of the prediction 
ht is to reduce the agents' performance via its effect on 
n^ 1 . This being the case we should expect that the effect 
of the prediction on the model would be no different to 
that of an exogenous source provided that the value of 
(h t ) t is preserved. In this section we shall check this by 
comparing the behavior of the original model with a dif- 
ferent memoryless variant. In this variant the prediction 
h t will be generated by a random source, external to the 
model, rather than taking the value +1 V t. We shall 
let lZ a represent the output of such a random exogenous 
source which contains only the two values —1 and +1 and 
for which a is the time average, (lZ a )t = &. We represent 
the binary sequence generated by the memory for h t in 
the original Genetic Model by S. 

Figure shows numerical results for (nf 1 ) t in the 
memoryless model with h t given by the exogenous 
sources 72.0.5 an d Ti-i.o- The results for the original model, 
{ht}t = S, are included for comparison. The results for 
{ht}t = T^i.o duplicate those presented in Fig. since 
{ht\t — TZ-i.o is equivalent to h t = +1 V t. In other 



So far we have treated the values of (ht)t that obtain 
for l c i < I < 0.5 and 0.5 < I < l C 2 as values to be 
derived empirically by numerical simulation. Now we 
shall discuss the theoretical reasons for their observed 
values. 

Lo has presented a theory 01 that, predicts, using our 
conventions, the following values for (ht}t- 



-1 For I < l cA 



(fh)t = 



-i For l cl < I < 0.5 



-| For 0.5 <l < l c2 
-1 For l c2 < I 



(33) 



However numerical simulation robustly yields values of 
(h t )t ~ ±5 in the dynamic regime. In what follows we 
summarize Lo's analysis with the addition of some obser- 
vations which explain why the numerical and analytical 
results differ. Note that as we pointed out in Sec. Ill G II 
the absolute values that obtain for (h t )t in the dynamic 
regime are not important for the theory that we present 
here. As long as (ht)t < for l c \ < I < 0.5 and (h t ) t > 
for 0.5 < I < l C 2 everything that we have said about 
opt [pi ; t] will remain unchanged. Only the magnitude of 
the relative performance of the original and memoryless 
Genetic Models depends upon the values taken by (ht)t- 
The analysis hinges on the observation that: 



(A t )t = (ht)t 



(34) 



From Eq. [S]and the definition of IIi ;t the probability that 
At = +1 is given by: 



P[A t 



I 2 if ft,* = — 1 
h if ht = +1 



(35) 
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FIG. 6: Illustration of the summations I\ and I2 in Eq. (I39H 
needed to calculate (/it)t in the dynamic region of l c i < I < 
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FIG. 7: Numerical data for Ib ;t illustrating that (nf t )t 
deviates from the value of opt[nf ht ] predicted by Eq. 1151 . 
The numbers under the curve represent the sum of the values 
of the data points lying on either side of the cutoff indicated 
by the dashed line. Note that this cutoff is at nf ht = Nl if 
I > 0.5 and nf ht = N(l — I) otherwise. The model parameters 
were as given in Tab. EJexcept that n — 10000 and the value 
of nf ht was sampled over the interval 2000 < t < 102000. 



where: 



Nl N 

i=JV(l-0 



i=0 



Therefore, the ensemble average values of the global ac- 
tion At when h t = +1 and —1 are given by: 



(At) 



2I 2 - 1 If h t = -1 
2h - 1 Eh t = +1 



(37) 



The time average of the global action is given by the 
sum of the values of (At) for ht = ±1 weighted by the 



probabilities that h t = ±1. Thus: 

w ,= (i^)( 2 / I -i ) + (l-Mi)( 2 / 2 - 1 ) 

(38) 

Finally, using Eq. (|34|l gives: 

h+h-l 



(ht 



h-h + l 



(39) 



In Fig. Elwe illustrate the summations over 11^ in Eq. 
()36JI that define the values of l\ and I2 ■ For the purposes 
of calculating these values, we have taken the mean of 
the distribution to be given by /1 = opt[n^ l ~' lt ] from Eq. 
(I15fl and the standard deviation to be a = 8. We can see 
from Fig. El that the values for I\ and I2 will be given 
by: 



h = 



if I < 0.5 
\ if I > 0.5 



\ if I < 0.5 
1 if I > 0.5 



(40) 



since the summations either cover the entire peak of the 
Gaussian, exactly half of it or none at all. From Eqs. 
(|3*5|) and lf4T))) we obtain values for (h t ) t in the dynamic 
regime of: 



(ht)t = 



-| if Z < 0.5 
+ \ if Z > 0.5 



(41) 



which agree exactly with the values found by Lo in Rcf. 

The values given in Eq. (|40() depend on the assumption 
that (nf ht ) t is given by the optimal value of Eq. ltl"5|) . 
In fact, as we see in Fig. Q numerical simulation using a 
large number of agents reveals that this is not the case. 



In the figure opt[n 



+h t 



-h t \ 



0.002/V corresponding 



to opt[p t ] — (p t )t ~ 0.002. Careful observation of Fig. 0]i 
reveals that (p t )t always lies slightly closer to 0.5 than 
the value predicted by Eq. Hlfi|l in the original model. 
As we can see from Fig. [7] this changes the values of the 
summations in Eq. I|40|l as follows: 

_ fo if Z < 0.5 7 [0.323 if Z < 0.5 
1 ^ \o.677 if Z > 0.5 2 \l if Z > 0.5 ' 

(42) 

which by Eq. 1)39(1 yields the following values for (h t )t- 

[-0.511 if I < 0.5 

(ht)t = < (43) 



These figures agree much more closely with the numer- 
ically observed values of (h t )t = ±0.5 than the values 
obtained in Eq. gUJ). 

We can therefore see that the values of (h t )t = ±0.5 
that obtain in the dynamic regime result from the fact 
that p t deviates slightly from the optimal value of Eq. 
(I16fl . In the absence of this deviation we would indeed 
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obtain the values given in Eq. i|33|) for (h t )t- We can 
see that the memory does indeed have a non-trivial role 
to play in the Genetic Model. The feedback between the 
prediction ht and the global action A t represented by 
Eq. (|34|l controls the values of (ht)t that obtain in the 
dynamic regime l c \ <l < l C 2- 



H. Deviation of p t from its optimal value 

In Sec. ITTClwhen we calculated the optimal values of 
p t in Eq. (|16|) we implicitly assumed that the gene value 
distribution is given by (|17(1 and that therefore a = 0. 
If a is non-zero then the values of nf 1 will be spread 
about (n^" 1 ) t . An examination of Fig. |2Jreveals that the 
reduction in Lit resulting from a deviation of nf 1 towards 
is less than that of a deviation towards N. Therefore we 
might expect that the agents would evolve such that p t 
was less than opt[p t ] when a > in order to compensate 
for this. This argument also applies to the memoryless 
model. We know from Eqs. l(T3)). itTij) (JTSJl and Fig. El 
that the change that occurs in Lit at nf 1 = Nl in the 
memoryless model is greater than that which occurs in 
the original model. Therefore, if the argument presented 
above were correct, we would expect a larger deviation 
in the memoryless case. However, we can see in Fig. Q] 
that the deviation of p t from opt[p t ] in the memoryless 
model is less than that in the original model since (p t )t 
lies closer to the line described by p t = I in the figure. 
Figure [S] shows numerical data for H it in the memoryless 
model. It is the equivalent of Fig. [7| Any deviation of 
the peak of IT ;t in Fig. [S]from the optimal value given in 
Eq. (|20J) is too small to be observable. Thus we can see 
that the observed deviation opt\p t ]—p t in the memoryless 
model is much smaller than that in the original model. 
Therefore we conclude that a different effect, one most 
probably related to the variation of the prediction ht, 
must be responsible for the deviation observed in the 
original model. 

We note in passing that since (p t ) t deviates from 
opt[p t ] given by Eq. (fTTTf) in the original model while 
no such deviation is observed in the memoryless model, 
the gene value distribution of the agents in the two mod- 
els differs for general I. We would expect this difference 
to vanish in the case of / = 0.5, as found by Burgos and 
Ceva , and for the frozen regimes of I < l c \ and I > l C 2 
in which the original and memoryless models arc equiv- 
alent. 



I. Autocorrelation of nf 

In this section we consider another property of the Ge- 
netic Model: the autocorrelation of the time series nf 1 . 
First of all we shall present numerical data contrasting 
the behavior of the autocorrelation in the original and 
memoryless models. We shall then consider a Markovian 
analysis of the original Genetic Model which explains the 
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FIG. 8: Numerical data for Ih;t illustrating that the deviation 
op^n^ 1 ] — (nf 1 ) t in the memoryless model is much less than 
that in the original model with memory. The dashed line 
represents the value of nf above which A t = —1. The model 
parameters were as given in Tab. [I] with / = 0.6, except that 
n — 10000 and the value of nf ht was sampled over the interval 
2000 < t < 102000. 



behavior of the autocorrelation of nf 1 observed therein 



in the simplest case of m = 1. 



1. Numerical Results 

We define the autocorrelation CJ^Xt) of a time series Xt 
as follows: 



C T (x t ) 



(44) 



where r gives the lag time. 

Figure shows the autocorrelation of the attendance 
time series Cr^nf 1 ) for < r < 10 and 1 < m < 5. 
We can see from the figure that for I = 0.5 there is no 
significant correlation for r > 0. For / ^= 0.5, in con- 
trast, Crtn^ 1 ) as a function of r has a clear structure. 



The magnitude of C T 
where i = 1,2,3.... 



if 1 } is non-zero for 



(m + l)i, 



and 



i +1 



are anticorrelated for 



odd values of i and correlated for even values. Figure ITUI 
shows C T (nf 1 ) for the same values of r and I depicted 
in Fig. [9] However, this time h t was derived from a 
random exogenous source rather than the memory. It is 
clear from a comparison of Figs. Inland ITUI that the struc- 
ture observed in C T (nf 1 ^ in Fig. [§Jis only present in the 
original model with memory. Figure ITTI shows the auto- 
correlation of the prediction time series h t . From this we 
can see that the structure present in Fig. [5] derives from 
structure present in the prediction time series. 

From Figs. 1911 II we can see that one of the functions 
of the memory is to introduce non-zero autocorrelations 
into the prediction time series h t . These will clearly not 
be present in cither of the memoryless models in which 
either h t = +1 V t or h t = lZ a . In Ref. Burgos et al. 
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found that for I = 0.5 the gene value distribution func- 
tions in the Genetic Model and the memoryless Genetic 
Model are equivalent. Further in Ref. Q Burgos et al. 
stated that "a simplified version of the model, that makes 
no use of memory is indistinguishable from the original 
formulation" . In contrast however, the numerical data 
presented in Figs. 151 and ITU demonstrates that the two 
formulations are distinguishable. While the memoryless 
model yields equivalent results for (nf 1 ) t , it does not do 
so for higher moments. Any analysis in which possible 
autocorrelations in the n^ 1 time series could be a fac- 
tor could not be conducted using the memoryless model. 
However, this effect would not be noticeable in an inves- 
tigation restricted to the minority case of I = 0.5. 



P = 1 — a respectively. This information can be used to 
further simplify the state space. We need not consider 
states which have no inward transitions or states which 
only have inwards transitions from states that we have 
removed. This allows us to remove states 2, 8, 9, 10, 
12 or 13 from consideration. The model will not visit 
these states once initial transients have died away. This 
reduces our state space from sixteen states to only six: 
0, 1, 4, 5, 6 and 14. 

Figure [21 shows the state transition diagram corre- 
sponding to Fig. E| in the simplified state space de- 
scribed above. From this we can form the following 
Markov transition matrix, in which the remaining states 
are arranged in numerical order: 



2. Markovian analysis 

In this section we will present a Markovian analysis 
of the action of the memory in the Genetic Model. This 
analysis must be performed separately for each value of m 
of interest since each leads to a distinct state space. Here 
we shall present the analysis for m = 1 only. Treatment 
of higher values of m is possible, although cumbersome 
since they lead to state spaces which are too large to be 
treated conveniently by hand. 

The first stage of our analysis is to define a conven- 
tion for labeling the states of the memory. Each state 
label must define the values of ht, A t and the state of the 
memory. This is the minimum set of information needed 
to calculate the state transition probabilities. For m = 1 
the memory will contain 2 m = 2 entries corresponding 
to the two possible histories A t -\ = —1 and A t -i = +1. 
We shall label these entries m^ 1 and mf 1 respectively. 
Each state therefore comprises four attributes each of 
which can take values ±1. There are therefore sixteen 
possible states which we denote using the shorthand no- 
tation described in Fig. [T^b . We label these states by 
analogy with the binary number system as shown in Fig. 

For the sake of clarity we shall consider the case of 
I < 0.5. By symmetry the results that we derive will also 
apply to I > 0.5. From Eqs. (J2SJ) and (12} , we have the 
following expression for the probability that A t = +1: 



P[A t 



I 2 = Q if h t = — 1 
h = if h t = +1 



(45) 



where a represents the numerical value of I2 in Eq. (|42|l . 
The result of this is that if the prediction h t — +1 then 
the global action At = — 1 with probability P = 1. We 
can therefore discard states 3, 7, 11 and 15 from consid- 
eration. In each of these states h t = +1 and A t = +1 
and thus they will never be visited by the model. 

Using Eq. I|45[) we derive the state transitions shown 
in Fig. E| We can see that each state either makes a 
transition to one other state with probability P = 1 or 
to one of two possible states with probability P = a and 
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(46) 



which defines the stationary Markov chain. 

From Eq. [1B| we can derive the n-step autocorrelation 
functions C n (h t ). Recall the oscillatory dependence of 
C T (h t ) on r in Fig. [TTJi. If d{h t ) = and C 2 (h t ) = -e 
(where e is of order 0.1) then we would expect to observe 
the dependence depicted in the figure. If h t and h t +2 arc 
anticorrelated and \C 2 (ht) \ < 1 then we should expect 
ht and h t +4 to be correlated and \Ci{ht) \ < \C2(ht) | and 
so on. Thus the form of C T {ht) depicted in Fig. II lb for 
m = 1 depends only on the values of C\(ht) and C 2 (ht). 

First we calculate the stationary state s of the Markov 
chain defined by Eq. 0^1 This gives: 



( 1-a \ 

a 

a(l — a) 

^,2 



V 



Q 

,,.2 



(47) 



/ 



From Eq. (|44fl . the one-step autocorrelation Ci(h t ) is 
given by: 



Ci(h t 



{h t h t +i)t - (h t ) 



(h 2 t )t 



(h 



t/t 



(48) 



Therefore we need to calculate values for (/if)t, (ht)t and 
(htht+i)t- Since h t = ±1, (ftf)i = 1. In order to calculate 
values for the other two averages it is necessary to form 
the vectors h and hi which respectively give the values 
of h t and h t h t +i for each of the states in the simplified 
state space. The elements of ho can be read directly from 
Fig. UJl This gives: 



ho = -1 +1 -1 



-1 



(49) 
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FIG. 9: Autocorrelation of the nf 1 time series for the range of values of m shown. Model parameters: n — 1000, a) / = 0.4. 
b) I = 0.5. 
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FIG. 10: Autocorrelation of the nf 1 time series with ht given by a random exogenous source. Model parameters: n = 1000. 
a) I = 0.4, h t = 11-0.5. b) I = 0.5, h t = fto.o- 



Taking the scalar product of ho with s gives: 
(h t ) t = h .s = ° 



a + 1 



(50) 



In Sec. Ill Gl we used an alternative method, not restricted 
to m = 1, to derive the expression for (h t } t in Eq. 
Substituting I 1 = and I 2 = a in Eq. i ffity yields Eq. 
(15 OH above. Thus, the analysis presented here and in Sec. 
Ill Gl arc consistent. 



We can derive h x by multiplying the values of h t and 
ht+i in Fig. ED This gives: 



hi = (+1 -1 -1 +1 +1 -lj . (51) 
Once again taking the scalar product with s gives: 



(h t h t +i)t = hi.s = 



a- 1 
a + 1 



(52) 
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FIG. 11: Autocorrelation of the ht time series for the range of values of m shown. Model parameters: n = 1000, a) I = 0.4. 
b) I = 0.5. 
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FIG. 12: a) The labels used to index the sixteen possible 
states. The states that are crossed out are those in which 
h t = +1 and A t = +1. These can be omitted due to Eq. 
I45H . b) A key for the graphical representation of each state. 



Therefore, from Eqs. 1481 1501 and 1521 we have: 

d(ht) = . (53) 

Thus, we expect ht and ht+i to be uncorrelated which is 
in agreement with Fig. II lb . 



The two-step autocorrelation C2{ht) is given by: 

r (h) _ (hth t +2)t ~ (ht) 2 t , , 

Therefore in order to calculate the two-step autocorre- 
lation function C2{ht) we must calculate (h t h t +2)t- As 



FIG. 13: List of the one-step transitions that the model can 
make from each of the states listed in Fig. 1121 




FIG. 14: State transition diagram corresponding to the tran- 
sitions depicted in Fig. I13l in the simplified state space. The 
arrow labels give the transition probabilities. 
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FIG. 15: Example of how to derive the values of htht+2 for 
each of the states in the simplified state space. 



before we must form the vector h.2 corresponding to the 
values of h t h t +2 for each of the states in the simplified 
space. This can be done following the method described 
in Fig. 1 151 which gives: 



h 2 = (+1 -1 +1 -1 -1 -1 
Taking the scalar product with s gives: 



(h t h t +2)t = h 2 .s = 



(1 - 3a)(l + a) 



(55) 



(56) 



(57) 



(58) 



(1+a) 2 

From Eqs. I5UI 1541 and 1561 we have: 
C 2 (h t ) = -a. 
Therefore, from Eqs. 1531 and ISTl we have that: 
C T (h t ) = 1, 0, -a, 0, a 2 , 0, -a 3 , . . . 

for: 

t = 0,1,2,3,4,5,6,... 



Note that as a result of this the autocorrelation of h t 
at m = 1 gives a direct measurement of the value of a. 
This provides a much more convenient measure of a than 
computing the summation for I 2 in Eq. I|36|l . as was done 
in Sec. ITTTT1 

Figure^] shows a comparison of the theoretical values 
of Ci(h t ), obtained with the numerical value of a = 0.323 
from Sec. Ill 111 with the numerical values given in Fig. 
ITT1 We can see from Fig. ^|that Eq. (|58|l correctly pre- 
dicts the form of C T (h t ) although it docs underestimate 
the correlation for r = 4. One possible explanation of 
this slight deviation is that in the analysis above we have 
assumed that a is a constant. However, the value of a 
depends very sensitively on the form of IT :t . Therefore 
fluctuations in the gene value distribution of the agents 
could cause significant fluctuations in a. Markovian anal- 
ysis of the I = 0.5 case confirms that Cjjit) = for r > 1, 
in agreement with the numerical results of Fig. 1111 



FIG. 16: Comparison of the theoretical result of Eqs. 1531 and 
1571 with numerical results. The error bars show one standard 
deviation on the mean over an ensemble of five separate data 
sets. 



J. Summary of Memory Characteristics 

We conclude from the results presented in this section 
that the Genetic Model performs better in the absence of 
memory. By this we mean that the average total number 
of points scored per time step when h t = +1 V t is twice 
that when h t is determined by the global memory. This 
was established semi-analytically in Sec. Ill CI and sup- 
porting numerical data was presented in Sec. Ill El We 
showed in Sec. Ill C 31 that the reason for this reduction 
in the performance in the presence of memory is that the 
agents cannot directly control the distribution of values 



for 



, + 1 



as they can in the memoryless model, because 



n^ 1 is now also a function of ht- In Sec. Ill 11 we pre- 
sented numerical data to show that the values of (n^ l " 1 )t 
and a{nf ,£) can be reproduced if the prediction h t is 
taken from an exogenous source providing that the value 
of (h t )t is preserved. Thus, the feedback between the 
global action of the agents and h t is of no benefit to the 
population of agents. The only function of this feedback 
is to regulate the value of (h t )t- 

We also investigated the values observed for the time 
average of the prediction (ht)t as a function of the re- 
source level I. We showed that the values of (h t )t of ±0.5 
that obtain in the dynamic regime of l c \ < I < l c i are 
due to the deviation observed between the mean of the 
agent gene value distribution P(x) and the optimal value 
predicted by Eq. (|2Ujl . 

In Sec. Ill II we showed that the form of the autocor- 
relation function of the nf 1 time series at I ^ 0.5 oc- 
curs as a result of the cycles in state space performed by 
the memory Finally we demonstrated that the two-step 
autocorrelation of the prediction £2(^4) can be used to 
provide a direct measurement of the deviation described 
above. 



III. SELF-INDUCED SHOCKS IN THE GRAND 
CANONICAL GENETIC MODEL 

In this section we move on to consider the important 
practical property of self-induce shocks, otherwise known 
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as endogenous large changes (ELC). Such large changes 
are arguably a defining characteristic of complex systems, 
yet there is no rigorous quantitative description of such 
events in real-world realizations of complex s yste ms (for 
examples see Refs. [H E H3 El E El ) • 



Introduction 




As we show here, the Genetic Model can be general- 
ized in a straightforward way to produce a model system 
which demonstrates such large changes. The extent to 
which these large changes are insensitive to the memory, 
then provides a useful tool for analyzing the microscopic 
causes underlying these large changes. In particular, we 
introduce an extension of the Genetic Model in Sec. [HJ 
in which the number (or 'volume') of active agents is a 
time dependent quantity. We shall refer to this variant 
of the Genetic Model as the Grand Canonical Genetic 
Model (GCGM). 

One particular application might be to financial mar- 
kets, where large changes are called crashes or draw- 
downs. However the Genetic Model does not directly 
yield a price time series. Therefore, if we want to con- 
sider the effect of endogenous large changes on price, we 
must derive one from fundamental observables such as 



,±i 



By definition the threshold value of n 



Nl cor- 



responds to the state in which the volume of the item that 
is being traded which is available for sale is equal to the 
demand. Therefore, the more general case of I ^ 0.5 rep- 
resents a system in which the volumes in which an item 
are bought and sold are not equal. The excess demand 
is then given by: 



A = 



I 



1 - I 



buy 



sell 



(59) 



If we let the action of an agent a^t = — 1, +1 represent 
choosing to buy or sell respectively then we obtain the 
following expression for the price 7r t+ i at time t + 1 in 
terms of the price at t: 



7Tt+l 



7T t 



/ 



l-l 



, + 1 



(60) 



where A is known as the market depth and determines 
the magintude of the change in price caused by a unit 
change in A. Different expressions for the price 7r t in 
terms of the excess demand A have been discussed in the 
literature (see, for example, Refs. [2i| and (24[). The 
linear expression in Eq. 1)60(1 represents the simplest of 
these and is not as realistic as expressions in higher pow- 
ers of A. Nevertheless it is more than adequate for the 
illustrative purposes for which we shall need it. Since in 
all that follows the units of the price 7r t are arbitrary we 
will take A = 1. 

An example of a large volume change observed in the 
GCGM is given in Fig. El We can see that large changes 
occur in the volume accompanied by large price move- 
ments. However, the behavior of the volume in the figure 



FIG. 17: Example of an endogenous large change of the 
volume in the GCGM with accompanying price time series. 
Model Parameters: n — 501, m = 3, r = 0.2, I = 0.5 and 
T = 12. 



is qualitatively different to that observed in the GCMG: 
Ref. |2l| reported two distinct types of behavior. For 
traders with a long memory the volume was observed to 
be continuously fluctuating with occasional particularly 
large fluctuations which were not instantaneous; much 
like the 'drawdowns' and 'drawups' discussed by Sornette 
in Ref. pfij . For traders with short memories the volume 
was frequently zero with occasional large instantaneous 
spikes and corresponding instantaneous price movements. 
In contrast, in Fig. El the volume exhibits small fluc- 
tuations and occasional instantaneous changes which are 
accompanied by periods of large fluctuations in the price. 
We shall see that the behavior that we observe in these 
figures is typical of the behavior of the volume in the 
GCGM. 



B. The Grand Canonical Genetic Model 

The Genetic Model represents an abstract model of a 
population competing for a limited resource and as such 
it often discussed in the context of financial markets 0, 
H H 03- 13 [Hill In such a context, however, it does 
not seem realistic that the agents trade at every time 
step. A real market trader would also have the option of 
withdrawing from the market and returning when s/he 
felt confident of a successful outcome. In order to model 
this extra degree of freedom, and in keeping with the 
work of Johnson ct al. in Ref. [2l|, we shall introduce an 
extension of the Genetic Model in which the agents are 
free to opt in and out of the game. 

As in the original Genetic Model described in Sec. [H] 
and Ref. Q there are N agents participating in the 
model. However, unlike the original model they do not 
all play at each time step. At any time t there will be 
two populations of agents, an active population and an 
inactive one. Active agents participate in exactly the 
same way as they do in the original model. In contrast 
an inactive agent i will continue to make its choice as 
if it were participating, however it will not be consid- 
ered when calculating the global action and its score will 
not be updated. We can imagine that inactive agents 
represent traders who do not make a trade at time t. In- 
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stead they make a prediction of whether their best choice 
would have been to follow the prediction a i:t = +h t or 
to refute it a, ; t = — ht- Since such a trade is virtual in 
that the trader does not act on it, it has no effect on 
the market and the trader is protected from losing, or 
even making, money. The only effect of such a virtual 
trade is that the agent reevaluates its confidence level 
depending on whether the trade would have been suc- 
cessful or not. These virtual trades are analogous to the 
virtual points earned by strategies in the Minority Game 
. They allow the agents to keep track of their potential 
performance without in any way representing an agent's 
wealth. 

These modifications necessitate some minor changes to 
the basic equations of the Genetic Model that we intro- 
duced in Sec. |nj Let the activity of an agent i be given 
by Zi- t . If Zi-t = then agent i belongs to the population 
of inactive agents and vice- versa for Zi-t = 1. Equation 
JTJ for the global action A t now becomes: 

. I +1 lit < ln t . v-^ /r, \ 

A t = < ,, where: n t = > z i:t ■ (61) 

1—1 nj > ln t . 

Inactive agents become active and active agents become 
inactive according to their performance in the recent 
past. We defined a new model parameter known as the 
confidence interval, T. An inactive agent will become ac- 
tive if it would have won for T consecutive time steps. In 
other words, an agent i for which Zi-t = will activate, 
z i;t+i = 1) h" o-i-.T = +A T for t — T < t < t. In the same 
way, an active agent will become inactive if it loses for T 
consecutive time steps. In order to control the activation 
and deactivation of agents we assign each agent a new 
quantity with we shall call its virtual points v^t, in keep- 
ing with the virtual points allocated to strategies in the 
Minority Game. For an active agent Vi-t is increased each 
time the agent loses and is reset to zero if it wins. Thus, 
for active agents Vi-t is the number of consecutive time 
steps for which the agent has lost. For an inactive agent 
Vi-t is increased each time that the agent would have won 
and is reset to zero each time it would have lost. The 
updating rules for Vi-t can be summarized as follows. 
If Zi-t = 0: 



If Zi* = 1: 



Vi-t+l 





Vi-t + 1 



if di-t = -A t 
if ai-t = +A L 



(62) 



Zi-t+l 







if Vi-t < T 
if Vi-t = T 



(65) 



The expression in Eq. @ for the number of agents for 
which a,;* = +1 becomes: 



4 1 = o ^2 z i;t (o»;t + 1) 



(66) 



In this section we shall define p t to be the mean gene 
value of the active agent population. Thus, p t is given 
by: 



Pt = — y2 zi-tPi-t 



(67) 



Therefore, the expression in Eq. @ for the mean number 
of agents following the prediction ht becomes: 



(68) 



l t h *) = ^2^;tPi-t 
i 

= n tp t 

Price Time-Series 



As we stated in Sec. IIII Al the cutoff, now given by 
nf = n t l, in the Genetic Model, defined by Eq. I|61|l. 
is by definition the state in which the excess demand 
A = 0. From Eq. H6U|) we can see that the price change 
An = nt — TTt—i is positive and negative for nf 1 < n t l 
and nf 1 > n t l respectively. By comparison with Eq. 1)61(1 
we can see that the condition that determines the sign 
of the price change at time t is the same as that which 
determines the global action A t ; with the exception that 
the equality in Eq. (|61|l gives a price change of zero. 
With reference to Eq. JHJl we have the condition that 
Att > in terms of nf ht : 



H 

,+h 



> th(1 - I) 



if h t = -1 
if h t = +1 



(69) 



If 



From this it follows that the probability P[Air > 0] that 
the price rises or remains the same at time t is given by: 



Vi-t+l 



Vi-t + 1 if at.t = -A t 







if ai-t = +A, 



(63) 



P[Att > 0] = 



h 



if h t = -I 
if h t = +1 



(70) 



The rules for agent activation and deactivation are then, 
If Zi-t = 0: 



Zi-t+l 







if Vi-t < T 
if Vi-t = T 



(64) 



where 1\ and I2 represent the summations defined by 
Eq. (|36fl and illustrated by Fig. HJ1 with the substitution 
N -> n t . We can see from Eq. J7D|) and Fig. that 
for I < 0.5 P[Att > 0] = for h t = +1 and for I > 0.5 
P[Air > 0] = 1 for h t = -1. The result of this is that, in 
the dynamic regime of the original GCGM with memory, 
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one of the values that ht can take will cause the price to 
rise or fall with probability P = 1. Note that in the cases 
of I < 0.5, h t = —1 and I > 0.5, h t = +1 we do not expect 
that P[Att > 0] = 0.5 as might be expected from Fig. [5] 
Recall that, as we discussed in Sec. Ill Gl p t deviates from 
the optimum values given in Eq. (|16[) in the Genetic 
Model. Furthermore, because the GCGM is frequently 
perturbed by ELC it does not settle into equilibrium in 
the same way as the Genetic Model and so p t is more 
variable although, as we shall see, it does remain close 
to the values given by Eq. ltT^|l in the periods between 
ELC. 



D. Endogenous Large Changes (ELC) in the 
GCGM 

As we shall see the behavior of the model in this regime 
is rich and complex. For this reason we shall initially 
consider a simplified memoryless variant of the GCGM 
which is analogous to the memoryless Genetic Model con- 
sidered in Sec. [H] Subsequently we shall consider how 
the inclusion of memory affects the behavior of the full 
GCGM. 

ELC in the GCGM result from a combination of two 
factors, both of which must be present if a large change 
is to occur. We shall see later that the capacity of the 
model to undergo an ELC depends sensitively on the dis- 
tribution of gene values P{p). However, a suitable P(p) 
is not a sufficient condition for an ELC to occur. It is 
also necessary for a particular pattern to occur in the 
global action time scries. We can think of this pattern 
as a trigger that initiates the ELC, but only if P(p) is 
in a susceptible state. Furthermore we shall see that the 
natural evolution of the model causes P(p) to evolve to- 
wards the most susceptible state while ELCs move P(p) 
towards the state in which it is least susceptible. Thus, 
rather than settling into equilibrium like the original Ge- 
netic Model, the evolution of the GCGM is characterized 
by a cyclic behavior: P(p) periodically evolving towards 
a more susceptible state until its progress is reversed by 
an ELC. 



1. Susceptibility of P(p) 

ELC like the one illustrated in Fig. El are the re- 
sult of highly correlated behavior of the agents. By this 
we mean that a significant number of agents activate or 
deactivate at the same time step. This implies highly 
correlated behavior since in order to do so the actions 
of all of the agents involved must be identical for the T 
preceding time steps. It is initially surprising that such a 
high degree of correlation could arise in the GCGM be- 
cause, unlike Minority Game agents, GCGM agents make 
their decisions stochastically. The probability of coinci- 
dence between the actions of a large group of agents will 
usually be very small. However, there are two groups of 



agents in the model whose behavior is well correlated. 
These two groups are those agents whose gene values lie 
within a certain small range S of and 1. We shall call 
agents belonging to these groups zero agents and one 
agents respectively. The degree of correlation of these 
agents is a decreasing function of 5, being a maximum 
for (5 = 0. It is easy to show that if we set an upper limit 
on the fraction fd of zero and one agents whose actions 
are not perfectly correlated over a period of T time steps, 
5 is given by: 

5 = 1 - (1 - f d )± (71) 

The precise value chosen for S is not important since it 
only serves to give a measure of the population of zero 
and one agents. Therefore it is more convenient to choose 
a fixed value for S which gives rise to values of fd which 
lie within an acceptable range, rather than choosing a 
different S for each value of T. In all that follows we 
shall take S = 0.02 which gives f d < 0.33 for T < 20. 

In short, we see that the probability of highly corre- 
lated agent behavior increases rapidly as the number of 
zero and one agents increases. Thus, gene values distri- 
bution functions P(p) which are biased to favor agents 
with gene values near to p = 0.0 and p = 1.0 will be the 
most susceptible to ELC. 

Now that we've considered what forms of P(p) are 
most susceptible we will discuss how P(p) evolves in the 
GCGM. In the periods between the ELC the number of 
active agents rit given by Eq. I|61|) is a slowly varying 
function of time. As we remarked above, correlations be- 
tween the behavior of large numbers agents are expected 
to be rare and so n t will fluctuate slowly with time as 
individual agents activate and deactivate. Hence the re- 
sults of Refs. d ITlcan be applied to the GCGM in 
these periods. RefsT^, 0| describe how in the Genetic 
Model the agents self- segregate into two populations hav- 
ing low and high gene values - these two populations can 
be thought of as a 'Crowd' and 'Anticrowd'. Thus, al- 
though we have yet to consider what the effect of an ELC 
will be on P(p) we can see that after such an event P (p) 
will evolve continuously towards the extremiscd distribu- 
tion described by Ref. From our discussion above 
we know that it is this type of extremised gene value 
distribution which is most susceptible to ELC. 

2. Triggers in the global action time series 

We saw above that it is only the zero and one agents 
which can participate in the highly correlated behavior 
necessary for an ELC. Therefore, in order to think about 
what patterns in the global action time series might in- 
duce an ELC it is necessary to consider these agents. If 
at time step t, A t = +ht then each zero agent will lose 
while each one agent will win. If A t = +h t for T con- 
secutive time steps then immediately following the Tth 
time step a fraction 1 — fd of the active zero agents will 
deactivate while the same fraction of the inactive one 
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agents will activate. Similarly if A t = —ht for T time 
steps then a fraction 1 — of the inactive zero agents 
will activate and a fraction 1 — fd of the active one agents 
will deactivate. Thus we can see that sequences of time 
steps in which A t = +h t or A t = —ht for T time steps 
will be important for the correlated agent activations and 
deactivations that make up an ELC. 

Since such sequences are important in the occurrence 
of ELC it would be useful to have an expression for the 
probability that they will occur. The first step is to derive 
expressions for the probability that At — ±/it- We shall 
see later that these expressions are important in their 
own right. From Eq. I|35|). substituting n t for N, we 
have the following expressions for the probability that 
A t = ±h t : 



Finally we can express Eq. i|75|) in terms of erf func- 
tions as, for h t = ±1: 



P[A t 



P[A t = +h 




(76) 



where for convenience we have defined A? as follows: 



X7=p t -(i-i) 

The erf function is defined as: 



Xt=p t -l. (77) 



erf(x) = —= 

\ /7T 



dt 



(78) 



E n, ;t if ftt = — i 

P[A t = -h t ] = { l = {1 n - l)nt 

E n i;t if h t = +i 



P[A t 



i — Int + 1 
(l-Z)nt-l 

E n l;t if h t = -i 

i = 
hit 

E n i;t ifftt = +i 

i = 



(72) 



We shall see later that n t ^> 1. Therefore we can use the 
continuous approximation of II t (x) for IL ;t . As demon- 
strated in Ref. ^Ii;t wn l be a Gaussian. In the 
GCGM only active agents contribute to the mean fx and 
variance cr 2 of U t (x). Therefore: 



1 l f-M' 



aV2ir 



(73) 



where: 



H = n t Pt 



v 2 = X] z i;tPi;t(l - Pi;t) ■ (74) 



In the continuous approximation we have: 



P[A t = 



P[A t 




+ 00 

/ U t {x)dx if ht = -1 

(1 - l)n, - i 

/ II t (x) dx if h t = +1 

ln t + i 
(l-i)n t -i 

/ IL t {x)dx if h t = -1 

— oo 

/ U t {x)dx if h t = +1 



(75) 



where we have used the fact that since a is of order unity 
the integrands in Eq. (|75|) will approximately vanish for 
x < and x > N. This has allowed us to replace lower 
limits of with — oo and upper limits of N with +oo. 



We can see from Eq. Ij76(l that the expressions for the 
probability that A t = iht depend upon the the values 
taken by h t . For this reason it will not be possible in 
general to derive a simple expression for the probability 
that At = ±/i t for T consecutive time steps. Such an 
expression would depend upon the realization of the pre- 
diction time series during the specific T time steps under 
consideration. In the next section, however, we shall sec 
that a simple expression can be derived in the case of the 
memoryless model. 

Ref. presented an analysis of crashes in financial 
markets such as the one that occurred on the NASDAQ 
in April 2000. The authors propose that such crashes 
result from speculative bubbles in which large numbers 
of traders share the same unrealistic expectations of the 
future performance of the companies in question. These 
bubbles eventually burst, apparently in response to some 
event which acts as a trigger. We can draw a broad qual- 
itative analogy between this and the GCGM. The ex- 
trcmised gene value distribution discussed in Sec. IIII D II 
corresponds to a state of the model in which large num- 
bers of agents share the same unrealistic expectation that 
the global action will be equal to or the opposite of the 
prediction. It is this speculative distribution that is most 
susceptible to triggers that occur from time to time in 
the global action time series. 



E. ELC in the memoryless GCGM 

We form the memoryless GCGM from the full model 
described in Sec. IIII Bl by taking ht = +1 V t in exactly 
the same way as we did in Sec. [H] Our discussion of 
the susceptibility of the gene value distribution function 
P{p) in Sec. IIII D II applies equally to the memoryless 
and the full GCGM. The equivalent of the patterns of T 
time steps in which A t = —h t or A t = +h t are those in 
which At = — 1 or At = +1. In the memoryless case, the 
expressions in Eq. (|76|l become: 



P[A t = ±1] 



1 ± erf - 



ntXl 



oV2 



(79) 
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This expression is no longer dependent on ht, since 
h t = +1 V t. Therefore, in the case of the memoryless 
GCGM, we can derive the following simple expression for 
the probability A t that At = ±1 for T consecutive time 
steps: 



A, 



1 




(80) 



Figure ITBk shows A t given by Eq. (|80() as a function of 
A^. However, it is easier to interpret a plot of the average 
waiting time, given by A^" 1 . A plot of this is included in 
Fig. HHb. 

There are a couple of points to notice in Fig. First 
of all the minimum of A t does not occur at Xf = but 
at a value of A, + = -J— . This results from the fact that in 

t 2n t 

the case of nf l ~ Int our model tie breaks by declaring 
the global action At = +1. The most obvious feature of 
Fig. ^]is that A t increases rapidly with increasing Xf . 
This means that the probability of a trigger sequence 
occurring in the global action time series At increases 
with the deviation of p t from I. Therefore we can consider 
the value of Xf to be controlling the probability that a 
trigger sequence will occur. Furthermore, note that A t 
never vanishes and so the average waiting time never goes 
to infinity. Crucially, this means that regardless of the 
value of p t there is always a non-zero probability of that 
a trigger sequence will occur. 



1. Price Time-Series in the memoryless GCGM 

One of the advantages of the memoryless GCGM is 
that the dynamics of the price time series are particularly 
simple. We saw in Sec. IIII CI that the probability of a 
price fall at time t is given by the same condition that 
the global action A t = -1. Therefore, P[An < 0] — 
P[A t = -1] given by Eq. lO. Tnus tnc P ricc wiu fal1 
with probability P < 0.5 if < and with probability 
P > 0.5 if Xf > 0. 



2. Example ELC 

Now that we have introduced the memoryless GCGM 
we shall consider a specific example of an ELC. This will 
allow us to see how the elements discussed in Sec. IIII Dl 
are involved in ELC in the GCGM. In order to do this, 
however, we need to introduce one further quantity which 
we shall call the prediction performance and denote by 
r)t- At time step t the value of % gives the number of 
previous consecutive time steps at which the prediction 
was the same as the global action A t — +h t . In the 
memoryless GCGM the value of r\t gives the number of 
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FIG. 18: a) The probability A t (defined by Eq. J80J) that 
A t = ±1 for T consecutive time steps as a function of Xf 
defined by Eq. 17711 . b) The average waiting time A^ 1 for 
such a pattern in the global action time series At to occur. 
We have taken m = 205 and at = 4.9 which correspond to 
their mean values over the period described by Figs. 1191 and 
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consecutive time steps preceding t at which A t = +1. 
The reason that r\ t is useful is that during one of the so 
called trigger sequences that we discussed in Sec. IIII D 21 
■q t will become large and, thus, it can be used to identify 
these events. 

Figure IT§1 lists the values of the global action A t and 
the prediction performance r] t for a period of time in 
which an ELC occurs in the memoryless GCGM with 
I = 0.4 and T = 12. In order to show the behavior of 
the one and zero agents that we discussed in Sec. 1111 D II 
we have also included the virtual points Vi-t of an agent 
i which is inactive and has pi-t = 1.0 at the beginning 
of the time period shown. Since the actions of zero and 
one agents are anti-correlated (as we saw in Sec. 1111 D If) 
the virtual points of inactive one agents and active zero 
agents will always be the same. The same applies to 
the virtual points of active one and inactive zero agents. 
For this reason it is only necessary to give v^ t for one of 
these four groups in Fig. IIII D II since from this we can 
infer the virtual points of the others. Figure l2Uk shows 
rjt graphically for the same time period as well as the 
probability P[A t = — 1] that the global action At = — 1 
at each time step, given in terms of Xf by Eq. 
Figure l2Ub shows the deviation of p t from Z, A^, over 
the same time period. The quantities depicted in Figs. 
HT?! and |20| arc those that play an important role in the 
mechanism that causes ELC. Later, in Fig. we shall 
demonstrate what effect the ELC described here has on 
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external observables such as the price and the volume. 
In the following paragraphs we describe the significant 
features in Figs. EH an d [201 The paragraph labels a- 
e correspond to the identically labeled time intervals in 

Fig. m 

a This sequence of T time steps in which A t = +1 
provides the trigger sequence discussed in Sec. IIII D 21 At 
each time step during this period the virtual points v^t of 
the inactive one agents and active zero agents increases. 
When the model reaches the final time step in this period 
Vi-,t = T = 12 for both inactive one agents and active zero 
agents. The inactive one agents will then activate while 
the active zero agents will deactivate. The virtual points 
of these agents will then be reset to 0. Unless the numbers 
of one agents activating and zero agents deactivating are 
approximately equal, this correlated behavior will lead to 
a step change in the volume like the one depicted in Fig. 
ITTI Only active agents contribute to p t (see Eq. I[67|l) 
and so this instantaneous loss of zero agents and gain of 
one agents causes p t , and therefore Xf , to undergo a step 
increase. We can see this clearly in Fig. |201 

b Throughout the period of T time steps labeled b 
in Fig. |HA+ « 0.16. Equation (O gives P[A t = -1] in 
terms of A^ and this is depicted graphically in Fig. 1211 
where we have taken values for nt and o~t corresponding 
to their mean values over the period described by Figs. 
[H and HO] We can see from Fig. |2T] that for A+ < 
-0.05 and A t + > 0.05 P[A t = -1] « and 1 respectively. 
Therefore, if the magnitude of Xf exceeds 0.05 the model 
becomes quasi-deterministic at time step t. Thus, for 
the period labeled b, A t = — 1. The effect of this on 
the zero and one agents is exactly the opposite of that 
of period a; the virtual points of the inactive zero agents 
and the active one agents now increases at each time step. 
Once again v^ t = T = 12 for both these populations 
at the end of period b and so the inactive zero agents 
activate while the active one agents deactivate. Note 
that the inactive zero agents activating at the end of 
period b are not just those that deactivated at the end of 
period a. Any zero agents that were inactive at the start 
of period a would have been unaffected by the trigger 
sequence, however they now activate along with those 
that previously deactivated. The result of this is that 
Xf does not return to the value of Xf ss 0.4 that it had 
at the end of period a. That would have returned the 
model to its non-deterministic state. Instead, however, 
Xf f» —0.06 and so at the start of period c P[A t = —1] = 
0. 

c This period of T time steps in which A t = +1 is 
identical to period a in terms of its effect on the one 
and zero agents. Thus, at the end of period c the in- 
active one agents reactivate and the active zero deac- 
tivate resulting in the same step increase in A^ that 
we observed before. The important difference between 
periods a and c is that the first occurred stochasti- 
cally (there was a significant non-zero probability that 
At = ±1 at each time step) whereas period c occurs 
quasi-deterministically (the probability that A t = +1 for 



the T time steps in period c is P w 1). 

d Similarly period d is identical to period b except 
that at the end A^ = —0.03. For Fig. [21 we can see that 
this gives P[A t = -1] > 0. 

e P[A t = — 1] is no longer sw and so the model re- 
turns to its usual stochastic behavior, A t taking values 
— 1 and +1 probabilistically. This represents the end of 
the ELC since there is now no mechanism for the syn- 
chronized activation and deactivation that occurs during 
periods a-e. The model now returns slowly to the equi- 
librium state. In other words the state that it is in once 
transients due to any ELC have died away. 

From the analysis that we have presented above we 
might have expected that the periodic synchronized acti- 
vations and deactivations that we described above would 
continue indefinitely and that the model would never re- 
turn to the stochastic state. One question that we did 
not address above, however, is that of how the model 
manages to break out of the deterministic behavior that 
it exhibits during the ELC. We have considered the effect 
of a period of T time steps in which A t = ±1 on the zero 
and one agents in terms of agent activation and deactiva- 
tion. However, we have not considered agent mutation. 
We shall see in what follows that it is agent mutation 
that allows the model to return to the stochastic state. 
During periods b-d the one agents are only active at 
time steps at which At = — 1 and the zero agents are 
only active when A t = +1. Because of this the scores 
of zero and one agents are decreasing functions of time. 
Their scores are fixed when they are inactive and when 
they are active their individual actions are the inverse of 
the global action: a^t = —A t . While these agents are 
inactive the model is favorable to them and so after T 
time steps they reactive. However, because the behavior 
of all these agents is so highly correlated, in doing so they 
change the dynamic of the model so that it is no longer 
favorable. This has a clear analogy with the phenomenon 
of market impact in economic systems. 

Since the scores of the zero and one agents are de- 
creasing functions of time during the ELC the scores of 
these agents will rapidly reach the death score Si- t = —D 
at which they mutate. If r 3> S = 0.02 (defined by Eq. 
171( 1 then with a very high probability of a mutating 
zero or one agent will mutate to a gene value of pi-t > 5 
or pi- t < 1 — S respectively. The result of this is that 
the population of zero and one agents that participate in 
the synchronized activations and deactivations steadily 
decreases throughout the ELC. Eventually there are no 
longer enough of these agents to maintain A^ at a mag- 
nitude greater than 0.05 and so P[A t = -1] ^ or 1. 
The model then returns to the stochastic state. 



3. Summary of ELC in the memoryless GCGM 

In this section we shall bring together the elements 
that we have introduced so far in order to give a broad 
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FIG. 19: The prediction performance rjt, At and the virtual points v^t of an agent i which initially is inactive and has a gene 
values of pi- t = 1.0. The model is the memoryless GCGM with n = 500, / = 0.4, T = 12, r = 0.2 and m — 3. 
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FIG. 20: Various important quantities plotted over the time 
period corresponding to Fig. 1191 a) The prediction perfor- 
mance r\ t (measured against the left hand axis) and the prob- 
ability P[At = — 1] that the global action is —1 (measured 
against the right hand axis), b) The deviation of p t from I, 



overview of ELC in the memoryless model. We demon- 
strated in Sec. IIII D II that in, what we now call, the 
stochastic state the agents migrate towards gene values 
p = 0.0 and p = 1.0. Therefore, while the model is in the 
stochastic state the number of zero and one agents in- 
creases. This increases the susceptibility of the gene value 
distribution P{p) to any trigger sequences that might oc- 
cur and also increases the duration of the next ELC. If 
a trigger sequence occurs in the evolution of the model 
then if there are enough zero and one agents an ELC 
will take place as described in the previous section. One 
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FIG. 21: Probability P[A t = -1] that the global action 
At = —1 in terms of \f given by Eq. 1791 with n t = 205 and 
a = 4.9. These values correspond to the mean values of these 
quantities over the time period depicted in Figs. 1191 and 12*01 



of the effects of the ELC is to reduce the numbers of 
zero and one agents. This decreases the probability of a 
subsequent ELC occurring. 

In order to make this clear we show in Fig. 122b the 
numbers of zero and one agents during the time period 
leading up to and after that depicted in Figs. H9landl2Cjl 
Figure l2*2*k shows over the same time period. As we 
saw in Sec. IIII E 21 ELC can be identified by the spikes 
that occur in the time series. We can see in Fig. "22 
that, while the ELC that we examined in Sec. IIII E 21 
causes a large decrease in the numbers of zero and one 
agents, significant numbers remain and so the initial ELC 
is followed by several smaller ones. The most important 
feature of Fig. |2U to notice, however, is that sudden 
decreases in the numbers of zero and one agents that 
occur at each ELC and the steady increase that these 
quantities exhibit in the periods between ELC. 
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FIG. 22: a) The evolution of the numbers of agents for which 
Pi-t < 8 (zero agents) and pi-t > 1 — S (one agents) over a 
time period that includes that depicted in Figs. 1191 and 1201 
b) A^ over the same time period. The occurrence of ELC 
is indicated by spikes in the A^ time series. The first ELC 
identifiable is that depicted in Figs. 1191 and 

So far, in order to understand the mechanism that 
leads to ELC, we have concentrated on quantities which 
are internal to the model. In the economic analogy these 
would correspond to quantities whose values would be 
extremely hard to quantify. For example the confidence 
of, or strategies adopted by traders. However, as we 
remarked in Sec. IIII Al ELC in the GCGM also affect 
quantities that are directly observable and quantifiable 
such as the volume and the price. We have plotted in 
Fig. 1231 the volume and the price over the same time 
period described by Figs. 1191 and |2*U1 We can see from 
Fig. [5j|]that the oscillatory activation and deactivation of 
the zero and one agents that we described in Sec. IIII E 21 
leads to corresponding oscillations in the volume. 

We saw in Sec. IIII E II that the probability that the 
price falls at time t P[A-7r t < 0] is equal to P[A t = — 1] 
which is in turn given in terms of Xf by Eq. (|79|l . Thus, 
we can see from the plot of P[A t = — 1] in Fig. 120b that 
the oscillations in Xf will give rise to alternate periods in 
which the price rises and falls as we see in Fig. 123b . In 
terms of A^~ the ensemble average excess demand is given 
by: 

<A> = . (81) 

Thus we can see that the magnitude of the price changes 
at each time step is proportional to n t |A^~|. Therefore, 
the overall fall in price depicted in Fig. 123b is due to the 
fact that the magnitude of the positive excursions of Aj~ 
during the ELC exceeds that of the negative excursions. 

4- Another example ELC 

In Figs. [2] and [55] we give a second example of an 
ELC in the memoryless GCGM. We can see that the be- 



FIG. 23: Evolution of the number of active agents (also know 
as the volume) and the price (defined by Eq. over the 

same time period described by Figs. 1191 and I2TJ1 

havior of the model depicted therein is much the same 
as that exhibited in Figs. EH an( l EOI The period la- 
beled a provides the trigger sequence of T time steps for 
which At = +1. As before Xf exhibits decaying step- 
like oscillations during periods b,c and e until the model 
returns to dynamic equilibrium at the end of e. Note, 
however that the periods c and e, for which A t — — 1 
and At = +1 respectively, are separated by three time 
steps, d, in which A t = — 1, — 1,+1. Figure |21 demon- 
strates why this occurs. We can see from the figure that 
during periods d and e Xf = 0.552 and that this is not 
sufficiently large to yield P[A t = —1] = 1.0. Because of 
this period e, in which At = —1, does not occur deter- 
ministically. However, since P[A t = — 1] = 0.967 it is 
more probable that At = — 1 for T time steps than when 
the system is in dynamic equilibrium. From Fig. 1241 we 
see that at the last time step of period d At = +1 be- 
cause P[A t = —1] 7^ 1.0 however T time steps then follow 
(period e) in which A t = — 1. Thus the model is fairly 
robust against stochastic fluctuations. It's not important 
if there is a period like d in which the oscillatory pattern 
is briefly broken. During this interjected period, how- 
ever, agent mutation will act to bring Xf closer to 0.0. 
Therefore the longer the interjected period the lower the 
probability that the ELC will continue. 

F. ELC in the full GCGM: an idealized case 

Before we examine some examples of numerically ob- 
served ELC in the full GCGM we shall consider a the- 
oretically idealized case. In this section we will assume 
that in equilibrium (p t )t is equal to the optimal value 
opt[p t ] given by Eq. 1)16(1 of 1 — / for I < 0.5 and I for 
I > 0.5. During the ELC we will assume that p t oscil- 
lates between values that are greater and less than the 
equilibrium value by a magnitude sufficiently large that 
P[A t = —h t ], given by Eq. (|76|) . takes only the values 
and 1. This yields the following values for P[A t = —ht], 
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FIG. 24: The prediction performance rj t , At and the virtual points Vi-t of an agent i which initially is inactive and has a gene 
values of pt- t = 1.0. The model is the memoryless GCGM with n = 500, / = 0.5, T = 12, r = 0.2 and m = 3. The symbols in 
the margins denote the positions of the step change of \f in Fig. 1251 
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FIG. 25: Various important quantities plotted over the time 
period corresponding to Fig. 1241 a) The prediction perfor- 
mance r/t (measured against the left hand axis) and the prob- 
ability P[At = — 1] that the global action is —1 (measured 
against the right hand axis), b) The deviation of p t from I, 
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FIG. 26: a) The evolution of the numbers of agents for which 
Pi;t < 5 (zero agents) and pi-t > 1 — 5 (one agents) over a 
time period that includes that depicted in Figs. 1241 and 1231 
b) \f over the same time period. The occurrence of ELC 
is indicated by spikes in the time series. The first ELC 
identifiable is that depicted in Figs. 1241 and 1231 
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R<opt| W : P[A t = -ht]=\ l . f/ J = +i R>op t W : P[A t = -h t ] = j * * \ '] (83) 



p t > opt[p t ] : P[A t = -h t ] = ^ 1 :e u _ i 1 (82) As we saw in Sec. IIII D 2l it is sequences of T time steps 

in which A t — —ht or A t = +h t which act as the triggers 
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FIG. 27: Evolution of the number of active agents (also know 
as the volume) and the price (defined by Eq. CT ) over the 
same time period described by Figs. 1241 and l2"51 

for ELC in the full GCGM. We can see from Eqs. (E2> 
and H83JI that A t = — h t with probability P = 1 when 
p t > opt[p t }. Thus, p t > opt[p t ] leads to the sequences of 
time steps which have the same effect on the zero and one 
agents as the sequences of time steps in which A t = —1 
that we saw in the memory less GCGM. The situation 
when p t < opt\p t ] is more complicated. We can see from 
the above expressions that for I < 0.5, A t = —1 while 
for I > 0.5, A t = +1. Thus, for p t < opt[p t ] we expect 
sequences of time steps in which A t = — 1 and A t = +1 
respectively. However, it is not immediately apparent 
that A t = +h t as we might expect. 

By application of the same Markovian analysis that we 
used in Sec. Ill 1 21 to the m = 1 case, we can derive the 
state transition diagrams given in Fig. EHl The state 
labels are as defined by Fig. El We can see from Fig. 
|2Hlthat when p t < opt[p t ] the transition diagrams each 
contain two attractor states in which h t — — 1 and +1 for 
I < 0.5 and I > 0.5 respectively. Thus, when the model is 
in these states P[A t = — h t ] = and therefore A t = +h t . 
The reason that they can be divided into two congruent 
sub-diagrams is that in each case, as we remarked above, 
A t only takes a single value. Therefore, the value of the 
memory bit that corresponds to the opposite value of 
A t has no significance. This leads to the two-fold state 
degeneracy that we observe; states which only differ in 
the value of this attribute are equivalent. Another feature 
to note in Fig. |2HI is that, depending on the state that 
the model is in when it changes from p t > opt[p t ] to 
p t < opt[jj t ], it may take several time steps to reach the 
attractor. Thus, unlike in the case of the memoryless 
GCGM, we should not expect oscillations with period 
T. There will likely be interjected time steps while the 
model finds the attractor. 

We can see that this analysis will apply to the case 
of general m by considering Eq. 1|82|) . For p t < opt[p t ], 
A t — — 1 or +1 consistently. At the first time step after 
the activation and deactivation of one and zero agents the 
history will contain a mixture of —Is and +ls. However, 
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FIG. 28: Markovian transition diagrams for ELC in the full 
GCGM. The state labels are as denned by Fig. [TJ and ± 
signs give the value of ht in each state. 



it is clear that after m + 1 time steps it will contain only 
— Is and +ls for I < 0.5 and I > 0.5 respectively and 
the memory bit corresponding to this history will also 
take the same value. These states in which the history is 
{-1, -1, . . . , -1} and h t = -1 or {+1, +1, . . . , +1} and 
h t = +1 correspond to the attractor states in Fig. 1281 



1. Example ELC 

Now we shall look at a numerical example. Figure |2*§1 
gives the values of the global action A t , the prediction 
h t and the prediction performance r\ t - Once again we 
have also included the virtual points v^t of an agent i 
which is inactive and has pi-t = 1.0 at the beginning of 
the time period shown. Figure 1291 is equivalent to Figs. 
II 91 and l2~4l except that we have additionally included the 
value of the prediction ht . Figure 1301 shows , r\t and 
P[A t — —h t ] given by Eq. H7b|) . The paragraph labels 
below correspond to the labels in Fig. 1291 

a Period a, in which A t ~ +h t , provides the trigger 
which causes the activation and deactivation of zero and 
one agents. 

b,c The (de) activation that is the result of a causes 
a step increase in as we expect. However, the magni- 
tude of Xf during b and c is not sufficiently large that 
P[A t = —h t ] = 1 when h t = +1. In fact, as we see 
from Fig. EOK, P[A t = -fit] « 0.9 when h t = 4-1. Thus 
we can see that in this regard the realization of an ELC 
described here is not ideal in the sense discussed in the 
previous section. It is because P[A t = — ht] ^ 1 that for 
the second time step of b A t = +h t resulting in these two 
interjected time steps. Throughout period c A t = —ht 
as expected. 

d,e,f The interjected period d corresponds to the 
model finding one of the attractor states in which A t ~ 
+h t . During period e, A t = +h t once again. However, 
after the reactivation and deactivation at the end of e 
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w 0. Therefore, by chance A t — +ht for the next 
three time steps as well, resulting in the interjected pe- 
riod f. This extra long period in which A t = +h t allows 
some zero and one agents who had failed to (de) activate 
during the T time steps e to do so. Thus, A^ increases 
and so therefore does P[At = —ht] (for h = +1). 

g,h,i During g, A t = —ht despite the fact that 
P[A t — —ht) ~ 0.7 (for h = +1). h once again corre- 
sponds the model finding the attractor state, i represents 
the final period in which A t = +h t before Xf returns to 
approximately the equilibrium value and the ELC comes 
to an end. 

Note that the periods like c and g, in which the magni- 
tude of Xf is not sufficiently large that P[A t = —h t ] = 1 
when h t — +1, occur with a much greater probability 
that they do in the memoryless model. The reason for 
this is that P[A t = —h t ] = 1 when h t = —1, unless the 
magnitude of the oscillations inp t is so great that A^ ~ 0. 
Therefore, for any time steps during periods like c and 
g for which ht = — 1, At = — ht with probability P = 1. 
We can see this clearly in the plot of P[A t = —ht] in Fig. 

ma 

In Fig. 1311 we show once again the volume nt and 
the price ir t over the same period described by Figs. 1291 
and [301 I 11 Fig. 131b we can clearly see the effect of the 
discussion in Sec. IIII CI Each time that h t = — 1 the price 
rises with probability P = 1. Furthermore the result of 
the deviation of the equilibrium value of p t from opt[p ( ] 
(see Sec. Ill G|l is that it is more probable that the price 
will rise rather than fall when h t = +1. These two effects 
ensure that in equilibrium (between ELC) the price n t is 
an increasing function of time. We can see in Fig. I31lthat 
one of the effects of the ELC described in this section is 
to halt and even briefly reverse this continuous price rise. 

The fundamental reason for this behavior is that, as 
we remarked in Sec. [H] the agents in the GCGM are 
unable to control directly whether they will buy or sell 
at each time step. They can only control the probability 
with which they will follow the prediction h t . As we 
saw in Sec. [H] the optimum behavior for the agents is 
to evolve such that the excess demand will be zero when 
ht takes it's most common value (h t = +1 in the case of 
/ > 0.5). Therefore, in the asymmetric case of / 7^ 0.5, 
the magnitude of the excess demand A will be large when 
h t takes the opposite value. This represents the agents 
mistakenly believing the prediction which in turn leads 
to an excess of buyers or sellers for I < 0.5 and Z > 0.5 
respectively. 

Johansen and Sornette have provided evidence Wl\ 
that large price changes in financial markets are outliers. 
By this it is meant that the frequency with which large 
price changes occur cannot be predicted using the dis- 
tribution of smaller price changes. From the results pre- 
sented in this section, we can see that the large changes 
that occur in the volume during ELC are also outliers in 
this sense. The distribution of volume changes between 
ELC is such that changes of the magnitude observed dur- 
ing an ELC occur with a very small probability. As we 



saw in Sec. IIII Dl a different mechanism (I.e. that of the 
susceptibility of P(p) and the occurrence of triggers in 
A t ) is responsible of the occurrence of ELC, which there- 
fore occur with a much greater probability. 



G. Summary of ELC Characteristics 

We have seen how ELC can occur in the GCGM as 
a result of the susceptibility of the gene value distribu- 
tion P(p) to triggers in the global action time series A t . 
We also saw that between ELC the self-segregation of 
the agents increases the susceptibility of P(p) while this 
process is reversed during an ELC. Furthermore we saw 
that an ELC in the the memoryless GCGM leads to an 
approximately periodic oscillation in a derived price time 
series. In contrast the price time series in the full GCGM 
is a divergent quantity resulting from the inability of the 
agents to control nf ht directly. In both models ELC lead 
to approximately periodic volume nt oscillations. 



IV. CONCLUSION AND FUTURE 
DIRECTIONS 

In conclusion, we have presented a detailed discussion 
of the role played by memory, and the nature of self- 
induced shocks, in an evolutionary population competing 
for limited resources. We have left open several interest- 
ing questions that we hope will be addressed by future 
work. First of all in Sec. IIIII we have only considered the 
case in which the death score D is less than the confi- 
dence interval T. The result of this is that agents mutate 
over a shorter time scale than the period of oscillation of 
an ELC. We would expect that values of D > T would 
lead to ELC that persisted for many more periods. 

We have taken the ratio of the number of points gained 
by an agent when ai-.t = +A t to that lost when a^t = —At 
to be unity. Hod, Nakar and Burgos et al. Refs. 0, S 13 
demonstrated that for values of this prize-to-fine ratio 
R < 1 the self-segregation of the gene value distribution 
observed by Johnson et al.Q is replaced by clustering 
behavior. In this case the agents tend to evolve towards 
Pi-t = 0.5 in equilibrium. This has implications for the 
occurrence of ELC since, as we saw in Sec. IIII D II this 
clustered gene value distribution is not susceptible to the 
trigger sequences that cause ELC. Therefore we expect 
that reducing the prize-to-fine ratio would suppress the 
occurrence of ELC in the GCGM. 

Perhaps most importantly, we showed in Sees. Ill Gl 
and III Hi that for the original Genetic Model in equilib- 
rium p t deviates from the optimal value given in Eq. I|16(l . 
This apparently small deviation has some important con- 
sequences as it determines the values taken by (h t )t and 
the magnitude of the autocorrelations observed in the ht 
time series. We have shown that since this effect is not 
present in the memoryless Genetic Model it does not re- 
sult from the finite standard deviation of the gene value 
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FIG. 29: The prediction performance rj t , At and the virtual points Vi- t of an agent i which initially is inactive and has a gene 
values of p i]t = 1.0. The model is the full GCGM with n = 500, / = 0.5, T = 12, r = 0.2 and m = 3. The symbols in the 
margins denote the positions of the step change of \f in Fig. 1251 
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time stop: t 

FIG. 30: Various important quantities plotted over the time 
period corresponding to Fig. 1241 a) The prediction perfor- 
mance j) t (measured against the left hand axis) and the prob- 
ability P[At = — 1] that the global action is —1 (measured 
against the right hand axis), b) The deviation of p t from I, 
A+. 



distribution as we might expect but must instead result 
from the action of the global memory. We hope that 
future work will provide clarification of this point. 
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